Interactive video apparatus with audio and video branching

ABSTRACT

A system providing user interaction with a video display provided to a standard television set by a home video cassette unit employing a standard video cassette having a video signal recorded thereon, the video signal having four or more audio channels and digital commands encoded therein, the system including apparatus for generating and moving a cursor on the television screen and a digital processor responsive to commands encoded on the cassette, to the cursor position and/or to viewer selections to control the video displayed and the audio output by the television receiver.

BACKGROUND OF THE DISCLOSURE

The subject invention relates generally to an interactive accessory for use with a video display. One particular aspect of the invention is the provision of a hardware interface between the user and a video cassette recorder band playback system using prerecorded video tapes (software).

The very large and still rapidly growing installed base of video cassette recorders (VCR's) worldwide in the mid-1980s has spurred interest in interactive home video systems.

It has become apparent to the inventors that an acceptable system for interaction with a home video display requires a suitable feedback loop between the operator and television screen. Apparatus providing such feedback further must allow for interaction at a considerable distance from the television screen and should not contain visual or operational nuances which annoy, distract or otherwise obstruct or discourage use by the viewer.

It is further desirable in such a system to provide "branching," i.e., the capability for selection of one of a number of video pictures and/or audio responses based upon user decision.

As is well known, the linear nature of the presentation obtained from an ordinary consumer-type of video tape player makes rapid branching to various segments of the tape an impossibility. While many VCR's provide features such as rapid forward and reverse search modes, there are still far too slow for branching in interactive game or learning environments.

A number of reasonably satisfactory solutions leading to rapid visual branching have nevertheless been proposed in the prior art and actually used with some success in experimental or commercial equipment. However, a successful interactive system requires not only visual branching but audio branching as well. Again, the availability of two audio tracks in some VCR's would permit two-way branching. Such branching proves marginally adequate in such applications as tutorial quizzes where audio branching responses can be limited to alternate sets such as "yes, you were right" and "No, your were wrong," or the like. However, instantaneous audio branching to four or more audio responses is mandatory to provide audio accompaniment for more complex visual branching schemes. Attainment of such capability requires a technique other than conventional sound "track" recording. Another obstacle to providing a large number of selectable audio responses is the necessity for an inexpensive audio encoding technique which is compatible with the limited bandwith of commercially available VCR equipment. Such equipment typically has a bandwith on the order of two megahertz (MHz). The limitations posed by such limited bandwith are indicated in part by the fact that such VCR equipment typically demodulates the color subcarrier of a standard television signal to a lower frequency and then remodulates it, resulting in color degradation of the VCR output. Audio encoding must cope not only with this obstacle, but also with discontinuities present in the television signal which can cause further degradation of audio performance.

SUMMARY OF NOVEL ASPECTS OF THE INVENTION

Accordingly, it is an object of this invention to provide a branching video system with the capability of branching to a number of video pictures and audio tracks which is substantially improved over prior art techniques.

It is another object of the invention to provide means by which the viewer may interact at normal viewing distances with a video display to trigger simultaneous audio and video branching.

It is a more specific object of the invention to provide interactive video apparatus employing a cursor on the screen of a standard television set actuable at normal viewing distances.

It is a further object of this invention to provide the capability for selective, instantaneous audio branching in conjunction with all types of consumer VCR's including those of the most inexpensive type having no special performance features beyond the most rudimentary ones.

According to one aspect of the invention, a branching capability is provided by nesting the required sound "tracks" within the displayable video signal. In the preferred embodiment, each channel is encoded as an amplitude modulated pulse, and the audio signal amplitude is tapered to zero near the vertical sync interval.

User feedback is provided by means of a cursor generated on a video display. One aspect of the cursor generation is the use of an optical sensor together with circuitry responsive to the sensor output for inserting the cursor on the video display. The invention particularly provides for integrating such cursor generation with a raster scan television signal. The cursor may be used to make selections on the screen, triggering simultaneous audio and video branching.

According to another aspect of the invention, digital commands are encoded in the video signal, extracted therefrom, and utilized by a digital processor to control the apparatus. The processor is further responsive to the cursor position, manual control signals and digital commands to select particular audio or visual responses for supply to a display screen.

The foregoing remarks are intended to highlight various aspects of the invention as a preface to the detailed discussion of the preferred embodiment. The invention is defined with particularity in the claims appended hereto.

BRIEF DESCRIPTION OF THE DRAWINGS

The preferred embodiment and best mode of implementing the claimed invention will now be described in conjunction with the drawings, of which:

FIG. 1 is a schematic block diagram illustrating a system employing the preferred embodiment;

FIG. 2 is a schematic diagram illustrating the optical sensor of the preferred embodiment;

FIG. 2a and 2b illustrate waveforms related to the optical sensor output;

FIG. 3 is a waveform diagram showing audio channels embedded in a video signal;

FIG. 4 illustrates apparatus for modulating a multiplicity of audio signals onto a video signal;

FIG. 4a illustrates a technique for pre-encoding audio signals;

FIG. 5a and 5b are schematic block diagrams illustrating the functional elements of the decoder unit of the preferred embodiment;

FIG. 6 is a circuit diagram illustrating in more detail the cursor generation and control logic of the preferred embodiment;

FIG. 6a depicts waveforms further illustrative of the cursor generation;

FIG. 7 is a circuit block diagram of the data logic of the preferred embodiment;

FIG. 8 is a circuit block diagram of the video blanking logic of the preferred embodiment;

FIG. 9 is a timing diagram illustrating placement of digital commands according to the preferred embodiment;

FIG. 10 is a block diagram illustrating the audio decode logic of the preferred embodiment;

FIG. 11 is a block diagram of the score display logic of the preferred embodiment;

FIG. 12 illustrates expansion circuitry according to the preferred embodiment; and

FIG. 13 is a flowchart related to operation of the preferred embodiment.

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT

As shown in FIG. 1, the preferred embodiment includes a main decoder unit 12 connected between a video recorder (VCR) 14 and a hand-held controller/sensor 16. In operation, the user interacts with the images on a screen 18 of a television 32 by pointing the controller 16 at the screen 18 and pressing a button 20 when a movable on-screen cursor 22 aligns with the user's selected choice. While referenced as a "video recorder" at times herein, it will be appreciated that the recording function of the VCR 14 is not important to the subject invention but rather only the "play back" feature.

The unit 12 employs signals from the tape 110 in the VCR 14 and its own internal programming to respond to the user's selection in a number of ways. For example, the unit 12 can transmit an audible "yes" or "no" indication to the television receiver in response to the position of the cursor 22 at the moment of depression of the button 20. The user's selection can also cause the unit 12 to institute a branch to one of five audio channels, or blank part of the television screen 18.

The unit 12 further provides not only an audio and/or video indication of a correct response, but also permits verbal and visual explanations of incorrect responses. The unit 12 may further track successive responses and accumulate data necessary to provide an evaluation such as a score at the end of a selected program segment.

There are two connections 13, 155 between the decoder 12 and the video recorder 14. These are video-in 155 and audio-in 13 provided, for example, by suitable shielded cables. RF-out is provided to the television receiver 32 by a standard coaxial connector 17.

The controller 16 is connected to the decoder 12 via a four-pin connector 19. The preferred embodiment is powered by a wall-mounted, low-voltage DC output power supply. Power is connected to the decoder 12 via a coaxial DC connector.

The decoder 12 has one top-accessed long throw slide switch 24 which controls both power and an RF switch. When off, the RF from the recorder 14 is connected to the receiver of the television 32. The button 20 of the controller 16 is a tactile feedback switch.

The decoder unit 12 processes both the output signal of an optical sensor (FIG. 2) and a video signal containing nested audio (FIG. 3) and digital commands (FIG. 9.) These signals and their generation will now be described as a preface to detailed discussion of the decoder circuitry.

In order for a pointing device such as the controller/sensor 20 to be effective when used at a significant distance from the television receiver, a method of visual feedback is provided. The preferred embodiment generates a cursor 22 on the screen 18 in a positive relative to the sensed position of the raster scanning spot on the screen of the television receiver.

As shown in FIG. 2, the hand-held probe 16 contains an optical system including a single lens element 26, which images a portion of the television screen 18, and hence the raster scan spot, onto a phototransistor 28. This phototransistor 28 may be an OPT500.

The lens 26 is arranged to focus a circular screen area approximately one inch in diameter onto the television screen on a 13 inch set at a distance of five feet. The resultant output 30 of the phototransistor 28 appears as a series of pulses on a 60 Hz component, the pulses having a frequency of 15734.263 Hz for a standard U.S. color set. This output 30 is illustrated in FIG. 2a. The output 30 is then filtered to provide a pulse train as shown in FIG. 2b. If the photodetector surface area is circular, the pulses of the pulse train will typically increase in amplitude as further shown in FIG. 2b. It is therefore preferable to use one of the later pulses, e.g. the seventh, for subsequent operations.

Because the rise time of the phototransistor 28 is quite slow, the timing of the transitions in the pulses of FIG. 2b varies significantly with screen brightness and spot position on the phototransistor 28. With a typical rise time of 8 microseconds, a significant portion of the left side of the screen 18 could not be used if the cursor 22 were generated at the time of the transition in the output 30 of transistor 28. This problem is corrected in the decoder unit 12 as discussed in more detail hereafter.

The audio system consists of six processor-selectable audio sources. The first source is a direct output from a microprocessor 15 (FIG. 5b) which is used to generate preprogrammed tone sequences or melodies for correct/incorrect answers, providing immediate response to the user's choice. The second source is derived from a direct connection to the audio output 13 of the VCR 14. The audio output 13 delivers signals stored on the standard audio track of the video tape.

The other four audio sources are extracted from four amplitude-modulated pulses encoded or "nested" in the video scan according to a technique illustrated in FIG. 3. These signals are extracted by applying a selected single train of them to a fast sample and hold circuit, which is then followed by a low pass filter. The circuitry used is shown in FIG. 10 and described in more detail in connection with that figure.

FIG. 3 illustrates the nested audio pulses. The waveform of FIG. 3 covers a single horizontal sweep (line) period lasting from the beginning of horizontal sync pulse 3f to the start of the next horizontal sync pulse 3g, or approximately 63.5 microseconds in the U.S. NTSC format. The sync pulse 3f is followed by a chroma burst 3h, which consists typically of about ten cycles at 3.579545 MHz. The chroma burst signal 3h is followed by four pulses indicated in the Figure as 3a, 3b, 3c, and 3d, respectively. The first two pulses 3a, 3b are separated by video from the last two pulses 3c, 3d. The pulses vary through a range of amplitudes, as indicated by the arrows in FIG. 3, reaching maximums near the white level and minimums near the black level. The pulses 3a and 3b are preceded and succeeded by one-half microsecond return-to-median intervals 3e. These intervals 3e reduce intermodulation distortion by separating the adjacent pairs of amplitude modulated sound pulses. The one microsecond pulses 3a, 3b, 3c, 3d represent the four nested audio channels, and are each amplitude modulated by a different audio signal. Since the pulses 3a, 3b, 3c, 3d occur at the horizontal sync/scan rate of 15,734 Hz, they can readily be amplitude modulated with audio signals up to approximately 5 kHz, since that is well below the minimum 2:1 Nyquist ratio.

According to the preferred embodiment, no audio pulses are encoded during the vertical sync period. Audio pulses do occur during the vertical blanking interval and beyond, but are masked by a "video border" generated by unit 12 to prevent portions of them them from appearing as intensity modulated bars at the left and right edges of the television picture.

FIG. 4 depicts a block diagram of circuitry for generating a waveform such as that shown in FIG. 3. This circuitry places four prerecorded audio sources, e1 through e4, on a video cassette 10d. The four audio sources e1 . . . e4 are applied to respective amplitude modulators 6, 7, 8, and 9. These modulators 6, 7, 8, 9 are driven by a single pulse from counter 100 at the proper time with respect to the prior horizontal sync signal on line tb, which starts the counter 100. The counter 100 is clocked by signals on line ta to produce the proper time relationships and pulse width required to generate the pulses referred to as 3a, 3b, 3c and 3d in FIG. 3. It is the function of amplitude modulators 6, 7, 8 and 9 to deliver a single pulse to line 10b whose peak value corresponds to the instantaneous level of the applied audio waveforms e1, e3, e3 and e4, respectively. Counter 100 also enables an analog transmission gate 11 via line lg at the appropriate time so as to permit an external video signal to enter from line 10e.

Thus, the signal on line 10b represents alternately external sync and color burst signals, from line 10e, two nested audio pulses from the modulators 6,7 followed by external video signal 10e, and then, two nested audio pulses, from modulators 8 and 9, followed by the next horizontal sync pulses and so forth.

It is to be noted that the counter arrangement shown in FIG. 4 is only representative of various circuit embodiments which could be used to produce properly-timed pulses. Moreover, to reduce audio distortion in the preferred embodiment, due to the lack of amplitude-modulated pulses during the vertical sync interval, it is preferred to make the instantaneous audio waveform ending levels equal to their starting level at the beginning of the next field. This is done during the encoding process by preconditioning of the audio signals to be encoded. The method of preconditioning in the preferred embodiment is to taper the audio signal A1 just prior to the start of the vertical sync pulse V1 as illustrated in FIG. 4A. Such a waveform may be achieved in FIG. 4 by subjecting each source e1, e2, e3, e4 to a linearly decreasing gain adjustment G triggered in anticipation of the vertical sync pulse V1 and a linearly increasing gain adjustment G triggered to begin at the end of the pulse V1. Such a gain adjustment may be made by inputting the audio signal e1 etc. to a signal multiplier such as the AD 7523KN which multiplies the signal by an input count. An up/down counter then supplies the input to the multiplier at the appropriate time to generate the tapered audio waveform. Other, more sophisticated computer prediction methods may also be employed to reduce the 60 Hz vertical sync interval transient disturbance of the otherwise contiguous audio signals.

As indicated above, the optical sensor-generated signal and nested audio signal just described in connection with FIGS. 2-4a are supplied to the decoder unit 12. The preferred embodiment of the decoder unit 12 includes three major sections: the analog section 11 (FIG. 5a), the digital section 130 and the microprocessor 15 (FIG. 5b). Each section is described in detail in the following discussion.

The analog section 11 (FIG. 5a) contains the video, audio and opto-electronic sections of the preferred embodiment. The video signal "video-in" from the VCR 14 is passed through a negative clamping circuit 170, which establishes the DC reference voltage of the sync pulses.

The video signal is then amplified in the video amplifier stage 190 and directed to four other circuits: a video clamping circuit 21, a sync stripper 23, a data comparator 25 and an audio sampling switch 27.

In the video clamping circuit 21, the video signal is processed. The signal is allowed to pass unmodified or is set to one of two fixed levels under control of the digital section 13, FIG. 5b. The levels are:

1. A fixed gray level corresponding to a 10% luminance level used for background for the score and cursor displays and the quadrant and border blanking.

2. A white level used for the cursor and score displays. A standard 1-volt P-P input is assumed for establishing the blanking, gray (10%) and white levels.

The output signal of the video clamping circuit 21 is then fed to an RF modulator 29 to amplitude modulate the channel three RF oscillator of the modulator 29. The output of the RF modulator 29 is fed to an antenna switch 31 (24 in FIG. 1), which allows the user to select the television 32 as the input source.

The sync stripper 23 detects sync pulses in the video signal and passes the sync signal to the sync separator 33 of the digital section 13.

The data comparator circuit 25 converts the video signal to a digital signal corresponding to a "one" for a white level and a "zero" for a black level. The threshold of the comparator circuit 25 is set to a 50% luminance level of the video signal so that any video signal exceeding 50% is interpreted as a "one". The digital output of the data comparator 25 contains digital commands ultimately transmitted to the microprocessor 15 by the data logic 44 of FIG. 5b, shown in detail in FIG. 7.

The audio sampling switch 27 selects and decodes one of the four nested audio signals. The analog switch 27 provides an input 24 to a sample/hold amplifier 35, which in turn outputs to a low pass (5 kHz) filter 37. The input 24 to the sample/hold amplifier 35 is selected by the audio logic 39 of the digital section 130. The input 34 can be one of four sampled levels of the nested audio, the main audio track of the VCR 14, or a processor-generated series of tones delivered on line 58. The output of the low pass filter 37 is fed to the RF modulator 29 to frequency modulate the audio subcarrier of the modulator 29.

The digital section 130 of FIG. 5b is divided into the following areas:

1. Timing section 43

2. Microprocessor interface section 45

3. Cursor logic 47

4. Data logic 44

5. Video blanking logic 57

6. Audio logic 39

The digital section 130 is designed to perform all of the time dependent functions associated with the video display.

The timing section 43 includes the sync separator circuit 33 which separates the vertical and horizontal sync pulses in the input from the VCR 14 on line 62, a nine-stage line counter 57 locked to the vertical sync pulses, and a nine-stage horizontal position counter 59 locked to the horizontal sync pulses. The counters 57, 59 maintain the effective position of the raster scan beam on the television screen. They are used by the other logic sections for timing.

Timing is controlled by a phase-locked timing generator 63 including a basic 4 MHz clock. The generator 63 includes a phase lock loop reset on each vertical pulse. The end count of the loop is compared against the count of the horizontal counter 59. If required, the clock frequency is adjusted to give 255 counts per horizontal period. By phase locking to the pulses themselves, the time shift caused by the output signal instability of the playback unit 14 is accomodated. Otherwise, a much higher clock frequency would be required and would still be subject to error.

To illustrate, with a free running 4 MHz clock, the timing error would be 250 nanoseconds (ns) or=1/2 the width of the pulses 3a, 3b, 3c, 3d encoding the audio channels. Such jitter in audio decoding is unacceptable. Therefore, total jitter is phase lock corrected at 15 KHz and the error is reduced to about 40 ns.

In the embodiment disclosed, the micro-processor 15 is an Intel 8048 series processor, and the interface 45 is specific to that device. The 8048 provides for data memory expansion using the data port, with address and data transferred over a common bus. The interface section 45 consists of four eight-bit registers used for control and data transfer. These registers are:

    ______________________________________                                         Location        Register                                                       ______________________________________                                         Hex                                                                            00              Video control register 67                                      01              Audio control register 67                                      02              Cursor position register 71                                    03              Data register 73                                               ______________________________________                                    

The video and audio control registers are indicated as a single block 67 in FIG. 5b.

The video control register 67 is write-only and has the following functions:

    ______________________________________                                         Bit                                                                            ______________________________________                                         0              1 = Quadrant 1 Blanked                                          1              1 = Quadrant 2 Blanked                                          2              1 = Quadrant 3 Blanked                                          3              1 = Quadrant 4 Blanked                                          4              1 = Border On                                                   5              1 = Cursor On                                                   6              1 = Cursor Hold                                                 7              1 = Score Display On                                            ______________________________________                                    

The audio control register 67 is write-only and has the following functions:

    ______________________________________                                         Bit                                                                            ______________________________________                                         0        1 = Audio channel 1 (first video position)                            1        1 = Audio channel 2                                                   2        1 = Audio channel 3                                                   3        1 = Audio channel 4                                                   4        1 = Audio channel 5 (main audio from VCR)                             5        1 = high output for processor tones                                   6        1 = low output for processor tones                                    7        Unused                                                                ______________________________________                                    

The cursor position register 71 is a read-only register and consists of the high nibble of both horizontal and vertical cursor positions arranged as follows:

high nibble=Vertical position

low nibble=Horizontal position

The data register 73 is a read-only register containing digital data down-loaded from the VCR 14 through the data logic 44. Accompanying this register 73 is a flag from the data logic 44 indicating "data present," which is directly connected to the interrupt input Int of the processor 15. The flag is reset after a read of data from the data register 73. Another input to the microprocessor 15 comes from the cursor button 20 and is shown connected to the input T1, which enables the button output to be polled by the microprocessor 15.

The digital circuitry of FIG. 5b further includes cursor logic 47 which cooperates in generating the cursor 22 on the television screen 18. The cursor logic 47 (FIG. 5b) is shown in detail in FIG. 6. It includes pulse select logic 77 which gates the seventh input pulse (FIG. 2b) from the optical sensor 41 after the vertical sync pulse. This seventh pulse starts a two and two-thirds line delay 79. This delay ensures that the cursor generator does not interfere with incoming data which indicates the cursor position. Rise time of the detector of approximately 8 microseconds must also be accounted for. Otherwise, the cursor may appear where the position is sensed on the next frame causing an undesirable visual jump of the cursor.

The output signal of delay element 79 triggers a latch 81 and a latch 83 used, respectively, to latch the counts of the line counter 57 and horizontal position counter 59 (FIG. 5b). This latched count defines the point at which the first line of the cursor is generated. When the latches 81, 83 are triggered, this contents match the respective line and horizontal position counts, causing a positive output (match detection) from each of two comparators 85, 87. The comparator outputs are provided to a signal generator 89.

The signal generator 89 includes first and second counters 84, 86, a quad input OR gate 88, first and second AND gates 92, 94 and an inverter 90. The OR gate 88 receives inputs from the 0 and 6 outputs of each of the counters 84, 86. The AND gate 92 receives an input from the output of the OR gate 88 and from the 7 output of the counter 86. The AND gate 94 receives inputs from the 7 outputs of both counters 84, 87 and from the inverted output of the OR gate 88.

The counters 84, 86 start on a positive output from the respective digital comparators 85, 87 and stop when they reach a specific count. They will remain stopped until cleared by the next reset. The horizontal cursor counter 84 starts each time the horizontal position counter 59 is equal to the count latched by the latch 81 after the 22/3 line delay provided by the delay element 79. The horizontal counter 84 is clocked by the same clock as the position counter 59. Following the output of the latch data signal from delay 79, the horizontal counter 84 will produce a pulse on decoded output 7 at the same horizontal position on all subsequent lines. The vertical counter 86 works in the same manner for the line count. Outputs 0 and 6 of each counter 84, 86 are ORed together by OR gate 88 and then AND-ed with the vertical cursor interval to form a black rectangle on the screen. The white output occurs whenever the square is "on" (defined as both vertical AND horizontal counters enabled (counting) AND NOT black output. The signal generator 89 thus generates the first line L1 of the cursor pattern shown in FIG. 6a, and then the subsequent lines L2 . . . L6.

If the hand held device 16 is not moved, the next time the raster passes through the field of view of the optical sensor 16, the cursor generation will begin at the same point of the raster. If the position of the hand held device 16 has been adjusted, the latches 81,83 will latch a new starting position and the cursor will be generated beginning at this new starting position. Hence, the cursor 22 follows the position of the handheld device 16 as long as the latter points to some portion of the screen 18.

The handheld device 16 uses an optical pickup. In order to function correctly with the handheld device 16, the display screen 18 should be as light as possible with no black objects. This does not limit the subject matter, but rather requires attention to the contrast ratio during video production. The video signal should be adjusted to produce a minimum luminance signal of 10% of the white level. This consideration is operative only during those periods when the cursor is enabled.

The data high nibbles (four bits) from each latch 81, 83 form the new contents of the cursor position register 71. In an improved embodiment, two sets of latches could be used, one to contain the position transmitted to the processor 15 and another to contain position data from the processor 15, with the latter data actually controlling where the cursor is generated. The processor 15 would then execute a debounce or integrate routine to eliminate small variation or vibrations in the cursor position.

Two control lines 91, 93 are associated with the cursor logic. The first, 91, allows the cursor display to be turned on or off, and the second, 93, allows the cursor position to be held.

The data logic 44 (FIG. 7) consists of an eight stage shift registers 95, clock control logic 97 and flag control logic 99. Data is stored on the tape 110 of the VCR 14 and is shifted into the register 95 over line 62 and through the data comparator 25 beginning with the first line following the equalizing period (line 10) (i.e., start bit on the tenth line, first frame after equalizing pulse begin audio, and the next nine lines contain the data). This timing is accomplished by the clock control 97 which gates clock pulses to the shift register 95 as determined by the vertical line count maintained by vertical line counter 57. The shift register 95 is shifted on each horizontal sync pulse. If the first data bit inputted to the shift register 95 is a "1" (white), the data flag is set in the processor 15 indicating that the processor should read the following data from the shift register 95. The data flag to the processor 15 from register stage 99 is reset over a line 101 when the data register 73 is addressed by the processor 15. If the first data bit inputted to the register 95 is zero (black), the processor interprets it as an indication that no valid data follows. Hence, the processor 15 does not read the data register 73.

The data format is further illustrated in FIG. 9. As shown, each bit occupies substantially an entire horizontal line period. This feature enables very reliable and inexpensive data extraction. In FIG. 9, the nested audio areas of the signal are indicated generally by the numeral 3.

The video blanking logic 57 (FIG. 8) is a series of decoders 103, 105, 107, 109, 111, 113 which decode the appropriate line and horizontal position counts for blanking the various sections of the screen 18. These decoders 103, 105, 107, 109, 111, 113 supply a control signal to the video control logic 51 on an output line 114. The output 115 of the vertical blanking decoder 103 is "on" during the period from the beginning of the first audio pulse 3a to the end of the fourth audio pulse 3d and is active from count 1 of the vertical line counter 57 through count 16. The output of the screen border decoder 113 is active during the periods of the auxiliary audio signals. The output of the quadrant 1 blanking decoder 105 is active from count 16 of the line counter 57 through count 122 and from horizontal count 88 through 292. The quadrant 2 decoder 107 is active from line count 16 through 122 and from horizontal count 291 through 496. The quadrant 3 decoder 109 is active from line count 121 through 261 and from horizontal count 88 through 292. The quadrant 4 decoder 111 is active from line count 121 through 261 and from horizontal count 291 through 496. The blanking function of decoders 105, 107, 109, 111, 113 is enabled by respective control signals on lines 116, 117, 119, 121 and 123 from the control register 67.

The audio logic 39 (FIG. 10) consists of a sample and hold switch 125, for decoding the four auxiliary audio channels, and three additional switches 127, 129, 131.

The positions of the auxiliary audio signals in the video signal are indicated on the timing digram of FIG. 3. The sample windows are generated by the horizontal position counter 59 and gated by a vertical sync decoder 133. The windows are:

Audio 1 10-11 [10.25 to 10.75] microseconds, determined by decoder 135

Audio 2 11.5-12.5 [11.75 to 12.25] microseconds, determined by decoder 137

Audio 3 60.5 to 61.5 [60.75 to 61.25] microseconds, determined by decoder 139

Audio 4 62 to 63 [62.25 to 62.75] microseconds, determined by decoder 141

The bracketed times indicate the interval actually sampled. The decoders are enabled by respective audio register bits ARB 0-3 from the control register 67. The three additional switches 127, 129, 131 connect (1) the main audio derived from the VCR sound track, (2) a high level or (3) a low level to the audio output 143. Each of the three switches 127, 129, 131 is controlled by an individual bit from the audio control register 67, the latter two switches 129, 131 constituting a two step, processor-controlled volume control.

The score logic 53 of FIG. 11 includes a shift clock 183, line decoder 185, read logic 187, write logic 189 and four 64-bit latches 191. Operation of the score display logic 53 (FIG. 11) is straight forward. Data is loaded from the data registers 73 to a 32-bit shift register 181 and then shifted out under control of the horizontal position clock. At the end of a line, the next 32-bit byte is loaded and shifted out at the appropriate horizontal position. The display will begin at horizontal count 400 and be shifted at 4 MHz.

Score logic 53 may be provided to generate a block sized A×B pixels in upper right corner so that the processor 15 may take score data out of memory and place it into a shift register. Score logic 53 serves to sync each line to the raster for proper insertion in the designated score area.

As alluded to above, the processor 15 is an eight-bit microcomputer, the Intel 8048. FIG. 12 indicates a 2k×8 random access memory RAM 201 which can be optionally included in the system, as well as an address latch 203, and control logic 205. The RAM can serve as program storage or as variable storage under software control. A program can be downloaded from the VCR 14, which would replace all of the internal program memory, and would allow the software designer total flexibility within the limits of the hardware. The design allows the RAM to be easily configured as an add-on module.

As indicated above, th decoder unit 12 is controlled by 8-bit serial data transmitted on the first nine lines after the vertical equalizing pulses. The unit 12 responds to a list of commands described in the following specification:

    ______________________________________                                         1.1 Commands:   (xx) = Hex                                                     ______________________________________                                         0.  Reset (00)        16.   Clear (10)                                         1.  Enable Selection (01)                                                                            17.   Disable Selection (11)                             2.  Branch On Previous (02)                                                                          18.                                                      3.  Branch On Score (03)                                                                             19.                                                      4.  Enable Score (04) 20.   Disable Score (14)                                 5.  Border On (05)    21.   Border Off (15)                                    6.  Cursor On (06)    22.   Cursor Off (16)                                    7.  Score Display On (07)                                                                            23.   Score Display Off (17)                             8.  Selection latch On (08)                                                                          24.   Selection Latch Off (18)                           9.  Enable Tune (09)  25.   Disable Tune (19)                                  10. Video Branching On (0A)                                                                          26.   Video Branching Off (lA)                           11. Sequential Mode On (0B)                                                                          27.   Sequential Mode Off (lB)                           12. Define Area 1 yes (0C)                                                                           28.   Define Area 1 no (1C)                              13. Define Area 2 yes (0D)                                                                           29.   Define Area 2 no (1D)                              14. Define Area 3 yes (0E)                                                                           30.   Define Area 3 no (1E)                              15. Define Area 4 yes (0F)                                                                           31.   Define Area 4 no (1F)                              ______________________________________                                    

1.2 Command Definition:

1.2.0 Reset:

resets answer memory

enables score

turns border off

turns cursor on

turns score display off

turns selection latch on

enables tune

turns video branch mode off

turns sequential mode off

resets the 4 defined areas to no, 0, 0, 0, 0.

and performs a clear command

1.2.1 Enables Selection:

enables select button on controller

1.2.2 Branch On Previous:

Branches audio (or both audio and video if video branching on) based on the last year selection.

1.2.3 Branch on Score:

(two byte instruction) Causes a branch based on the number of incorrect responses made after a reset command. If zero incorrect, a branch is made as if a selection 1 has been made. The second byte defines the increments of incorrect responses required for other branches. If the second byte is 1, the branches will be:

    ______________________________________                                                # incorrect                                                                            selection                                                       ______________________________________                                                0       1                                                                      1       2                                                                      2       3                                                                      3       4                                                                      4 or more                                                                              5                                                               ______________________________________                                    

For a second byte of 2 the branches will be:

    ______________________________________                                                # incorrect                                                                            selection                                                       ______________________________________                                                0       1                                                                      1-2     2                                                                      3-4     3                                                                      5-6     4                                                                      7 or more                                                                              5                                                               ______________________________________                                    

For a second byte of 3 the branches will be:

    ______________________________________                                                # incorrect                                                                            selection                                                       ______________________________________                                                0       1                                                                      1-3     2                                                                      4-6     3                                                                      7-9     4                                                                      10 or more                                                                             5                                                               ______________________________________                                    

1.2.4 Enable Score:

Enables score keeping counter

1.2.5 Border On:

Turns on the screen border to mask auxiliary audio channels information from the viewer.

1.2.6 Cursor On:

Enables the on-screen cursor generated by the sensor.

1.2.7 Score Display On:

Displays 5 digit score in upper left corner of screen. (used only with expansion module)

1.2.8 Selection Latch On:

Allows only one selection to be made per branch segment. Selection remains latched until clear command is received.

1.2.9 Enable Tune:

Causes decoder to generate one of two hardware defined tunes based on the selection definition being "yes or no".

1.2.10 Video Branching Mode On:

(two byte instruction) At the time of the disable selection command, causes blanking of non-selected area. The area and pattern are defined by the second byte.

The second byte is divided into a format (D3 and D2) and an offset (D1 and D0).

The formats are:

    ______________________________________                                         D3     D2                                                                      ______________________________________                                         0      0           four quadrants, all selectable                              0      1           four quadrants, quadrant four                                                  always on.                                                  1      0           two sections, right-left                                    1      1           two sections, top-bottom                                    ______________________________________                                    

The offset is a number from 0 to 3 which is added to the selection number to locate the area to be displayed.

1.2.11 Sequential mode on:

allows branch based on time of the selection button being pressed. Four branches are allowed, the selection counter is incremented by an enable selection instruction. Both the audio and video branch (if enabled) occur on execution of a disable selection. The second byte of the instruction specifies which answer is correct, the correlation between branches and the position in the sequence.

    ______________________________________                                         Second byte    sequence #                                                      ______________________________________                                         0.             1,2,3,4                                                         1              2,3,4,1                                                         2              3,4,1,2                                                         3              4,1,2,3                                                         ______________________________________                                    

1.2.12 Define Area 1 yes:

(five byte instruction) defines area of screen which causes an audio branch to track 1. Defines selection as #1 for Video Branching, increments correct score

Byte 2: Start of horizontal area

Byte 3: End of horizontal area

Byte 4: Start of vertical area

Byte 5: End of vertical area

1.2.13 Define Area 2 yes:

(five byte instruction) defines are of screen which causes an audio branch to track 2. Defines selection as #2 for Video Branching, increments correct score.

Byte 2: Start of horizontal area

Byte 3: End of horizontal area

Byte 4: Start of vertical area

Byte 5: End of vertical area

1.2.14 Define Area 3 yes:

(five byte instruction) defines area of screen which causes an audio branch to track 3. Defines selection as #3 for Video Branching, increments correct score.

Byte 2: Start of horizontal area

Byte 3: End of horizontal area

Byte 4: Start of vertical area

Byte 5: End of vertical area

1.2.15 Define Area 4 yes:

(five byte instruction) defines area of screen which causes an audio branch to track 4. Defines selection as #4 for Video Branching, increments correct score

Byte 2: Start of horizontal area

Byte 3: End of horizontal area

Byte 4: Start of vertical area

Byte 5: End of vertical area

1.2.16 Clear:

Returns from branch. Audio channel 5 (main) is selected and the fully screen is displayed. This command also resets the video branch to off, resets the sequential mode off, and turns the cursor on.

1.2.17 Disable Selection:

disables select button

This command terminates the answer period. The video branch is made at this time. If the sequential mode is enabled, the audio branch will also occured at this time.

1.2.18 Not used.

1.2.19 Not used.

1.2.20 Disable Score:

Disables the score counter until the next clear command. Used in the same manner as Disable tune.

1.2.21 Border off:

allows software to remove border, for full screen display. Audio branching is disabled.

1.2.22 Cursor off:

allows software to disable cursor. Cursor is re-enabled by executing a "clear".

1.2.23 Score display off:

Turns off score display. Used only with expansion module.

1.2.24 Selection latch off:

allows software to enable multiple answers. Selection is latched until another selection is made. The last selection made prior to the disable selection command will be used for score keeping and branch selection for the explanation phase.

1.2.25 Disable Tune:

Disables "correct/incorrect" tunes after selection is made. When disabled, the decoder produces a single tone when the select button is depressed. This command is used whenever a question does not have a right or wrong response but is merely a choice.

1.2.26 Video Branching Off:

Turns video branching off Audio branching remains active.

1.2.27 Sequential Mode Off:

Returns from sequential mode

1.2.28 Define Area 1 no:

(five byte instruction) defines area of screen which causes an audio branch to track 1. Defines selection as #1 for Video branching. Increments incorrect response counter.

Byte 2: Start of horizontal area

Byte 3: End of horizontal area

Byte 4: Start of vertical area

Byte 5: End of vertical area

1.2.29 Define Area 2 no:

(five byte instruction) defines area of screen which causes an audio branch to track 2. Defines selection as #2 for Video branching. Increments incorrect response counter.

Byte 2: Start of horizontal area

Byte 3: End of horizontal area

Byte 4: Start of vertical area

Byte 5: End of vertical area

1.2.30 Define Area 3 no:

(five byte instruction) defines area of screen which causes an audio branch to track 3. Defines selection as #3 for Video branching. Increments incorrect response counter.

Byte 2: Start of horizontal area

Byte 3: End of horizontal area

Byte 4: Start of vertical area

Byte 5: End of vertical area

1.2.31 Define Area 4 no:

(five byte instruction) defines area of screen which causes an audio branch to track 4. Defines selection as #4 for Video branching. Increments incorrect response counter.

Byte 2: Start of horizontal area

Byte 3: End of hoizontal area

Byte 4: Start of vertical area

Byte 5: End of vertical area

The preferred embodiment is a question/response device, with the response being software programmable. The following discussion illustrates several types of question/response formats and the sequence of commands derived from the pre-recorded video tape of the VCR 14, required to produce the desired response.

MULTIPLE CHOICE FORMAT

One format is a multiple choice format, with audio branching only. In this format there is a preamble during which a question or choice is raised, a description of the required action is made, a question posed, a period of time to make a selection allotted, and a response made based on the user input.

A sample script might read as follows: Narrator: How good are you at identifying farm animals . . . I'm going to show you a picture of a barnyard and ask you to pick out an animal. Let's find the cow first. When you point the sensor at the screen, a screen, a little white square will appear on the screen. Move the square over the animal you think is the cow and press the bottom. CUT TO BARNYARD containing a cow, pig, and a horse: barnyard background sounds. At the time the button is pressed, the unit 12 executes a branch to one of the five audio tracks available.

Audio #1: (if cursor in area defined as "cow") You are right, that is the cow.

Audio #2: (if cursor in area defined as "pig") That was the pig, better luck next time.

Audio #3: (if cursor in area defined as "horse") Close, but that is a horse.

Audio #4: (if button bushed, but cursor not in any of the above areas) You have to point more carefully, I didn't understand.

Audio #5 (Main audio): I can't tell what you're thinking, you need to point and push the button. Narrator: Discusses the attributes of a cow.

The above question/response can be tailored to any age group, and is intended to verbalize the multiple choice mode.

The sequence of program commands required to implement the foregoing script is:

1. Reset: This command resets the decoder 12 to the initial starting conditions. For a more detailed description refer to section 1.2.0 and 1.2.16.

2. Define #1 yes: This command is used to set the screen position of object number 1 and indicate area 1 is a correct response. Note that the screen is divided into 256 zones, 16 horizontally and 16 vertically. The target area is defined by 4 sets of numbers (0-15) which define a rectangular area of the screen. The numbers define the starting and ending point horizontally and vertically. The code 0,1,0,1 set the target to the upper left hand corner of the screen, while 0,15,0,15 would define the entire screen. Thus, it can be seen that up to 256 "window shades" can be placed over, and hide, background VCR video (pictures) in any combination, such as 4×4 quadrants, etc.

3. Define #2 no: This sequence of commands defines the second object and also specifies that the area is an incorrect response. It is important to note that the areas defined may overlap. When a decision is latched by the decoder (when the button is depressed), the microprocessor 15 will determine if the cursor position is within the area of object 1, 2, 3, or 4 in that order. If, for example, the cursor is pointed to an area which is within both 1 and 2, the unit will select #1.

4. Define #3 no: Defines object #3 and specifies the area as an incorrect response.

5. Define #4 no: Defines object 190 4. In the above example, object #4 is the full screen and is also defined as an incorrect response.

To transmit the above list of commands will require approximately 0.35 seconds. These commands may be loaded on the pre-recorded VCR tape 110 at any time during the lead-in to the question. It is desirable that the commands be as close as possible to the enable selection command to avoid incorrect response if the tape is rewound to repeat the question. It is also recommended that each segment be preceded by at least 5 reset or clear commands, in case the VCR 14 is stopped during the playback of a multi-word command. It is not required to load areas which are not used.

6. Enable Selection: This command allows the button 20 on the sensor 16 to select a target. Normally, the selection is latched, until the next clear command. If this is not desired, and the software designer would like to allow several choices, a Selection Latch Off command is inserted into the tape, priorto the enable selection command.

During the selection period the objects can move but must remain within the selection area defined above. It is possible to redefine a target area at any time, however, it will take on the order of 0.083 seconds to down load new data. If the button 20 is pushed during the selection period, the decoder 12 will:

(A) latch (hold) the last cursor position, and the cursor will blink.

(B) Play an audible tune through the T.V. speaker by programming in the "Enable Time" command. This tune will be a "correct" song if the selected area is designated as correct, and an "incorrect" tune if not. The tunes are hardware defined, not software programmable. If these tunes are not desired, they can be disabled by encoding a "Tune Off" command during the preamble to the question.

(C) Branch to the audio track determined by the area definition statements.

(D) Increment the internal score keeping register based on the area definitions. This action can be disabled by encoding a "Disable Score" command.

In most cases, all 5 audio tracks will contain the same audio, until the end of the selection period, at which time the explanations are given.

6. Disable Selection: This command disables the select button and is inserted to prevent an audio branch during the explanation phase.

7. Clear: This command returns the audio to channel 5 and terminates the branch.

The multiple choice format can be used for as few as one choice, such as might be used in a sound matching type of question. In this case the correct response would be to press the select button 20 or not press the select button 20. A correct segment is coded with:

1. a clear command

2. define area 1 yes 0,15,0,15 (full screen)

3. enable select

4. disable select

5. clear

An incorrect segment uses the "Define Area 1 No" command as follows:

1. clear command

2. define area 1 no 0,5,0,15 (full screen)

3. enable selection

4. disable selection

5. clear

VIDEO BRANCHING FORMAT

Another format available in the Video Branching Format (branching audio and pseudo branching video): The video branching format operates in the same manner as the multiple choice format but allows the video to perform a pseudo branch as well. This is accomplished by blanking (placing "window shades" over) those quadrants of the screen not related to the choice made by the viewer.

Using the same script example as above, an explanation may be presented as to why answers were right or wrong, pointing out the differences between the operator's choice and the correct response, as before, but displaying appropriate quadrant-sized visuals.

The commands are the same as in the initial example with the addition of a "Video Branch Mode On" command and would be:

1. clear

2. define area 1 yes

3. define area 2 no

4. define area 3 no

5. define area 4 no

6. video branch mode on

7. enable selection

8. disable selection

9. clear

The Video Branch Mode On command has several options, which allow the screen to be blanked in several ways.

The first is four quadrants. In this mode, only one quadrant is unblanked at any one time, the area being defined by the selection will be displayed in the following manner: selection 1 top left, selection 2 top right, selection 3 bottom right, and selection 4 bottom left. If no selection is made the entire screen will be blanked.

The above command may be modified by specifing an offset between 0 and 3. This offset is added to the selection number to determine the area displayed. If the selection were 2 and the offset 1, the bottom right quadrant would be displayed.

The second is four quadrants with quadrant 4 always on. This is used if the software designer would like to have a continuous video display for prompting. This command is also altered by specifying an offset. The offset changes both the selection and the position of the prompting quadrant.

The third possibility is a vertically split screen. In this mode the left side of the screen is defined as choice 1 and the right side as choice 2. If an odd offset is used, the right side will be displayed on selection 1.

The fourth option allows the screen to be split horizontally with the top defined as selection 1 and the bottom defined as selection 2. If an odd offset is used, the bottom of the screen will be displayed on selection 1.

The screen is blanked as soon as the disable selection command is received and remains blanked until the clear command is received. As in the standard multiple choice format, the audio is branched to the selection number at the time the selection is made.

SEQUENTIAL FORMAT

Another available format is a sequential format. The sequential format does not require the cursor 22. In this format the answers to a question are presented serially. The user presses the select button during the time the correct response is shown, and the branch is then made at the end of the segment (when the disable selection command is given).

The sequence of commands is:

1. clear

2. sequential mode on

3. enable selection

4. enable selection

5. enable selection

6. enable selection

7. disable selection

8. clear

Each time the enable selection command is encountered the choice counter is incremented. Part of the sequential mode command specifies which answer is correct. Sec. 1.2.11 above.

The preferred embodiment provides a method of score keeping which is activated by commands encoded on the video tape 110.

The "Branch On Score" command is a directed branch based on the number of incorrect selections made after the last reset command. A second part of the instruction specifies the range associated with each branch. Refer to section 1.2.3. for further explaination.

The Branch On Score command can include both audio and video branching and follows the same type of format except that the user's score determines the branch rather than the user's response. A typical command sequence would be:

1. clear

2. branch mode on

3. branch on score

4. reset

Each audio track would contain specifics relating to the score such as "Perfect, you got them all," while the screen would display a quadrant with a gold medal, etc. This function is entirely software definable within the 5 audio and 4 video branch limitation.

There are several additional commands which are available to the software designer.

The first set of commands are the Latch Mode On and Latch Mode Off commands. These commands typically precede a branch. If the Latch Mode Off command is given, the cursor 22 will remain under user control, and the audio and video branches will be based on the last cursor position selected before the disable selection command is received. This allows the user to make many choices. The "yes" or "no" tune is played at the time of selection but both audio and video branches will not occur until the disable selection command is received. Score keeping is based on the last selection made. The function is cancelled by a Clear or a Reset command.

The second set of commands allows the software designer to control the cursor 22. It may be desirable during certain portions of a presentation to disable the cursor 22. The cursor 22 may be turned "on" or "off" with the appropriate command. The cursor 22 is turned "on" after a reset or clear command.

The third set of commands allows the border to be controlled when the 4 additional audio channels are present. As the audio channels occur only during branches, this command allows the software designer to eliminate the border when it is not required and re-establish it when needed. A reset or clear command will turn the border off.

The exact sequence of commands used to set-up the system for a specific question or branching segment is not important. Each command will be executed on the completion of transmission and should be synchronized to the visual display. In practice, the software designer should provide a list of the commands required for a particular branching segment.

The programming of the processor 15 is straight forward, being designed primarily to execute the commands described above. As indicated in FIG. 13, the main program loops, awaiting an interrupt from the circuitry of FIG. 7 to indicate to it that new digital commands are present. After the processor 15 is interrupted, it retrieves the command, and the program jumps down to a routine for executing the command.

As discussed above, areas are "defined" on the screen for specific modes of operation by encoding appropriate commands on the video tape played back by the VCR 14. The VCR program further determines the timing of an audio-only branch, the timing of a video branch, and the format of the video branch. A command is also programmed onto the tape and executed which enables the processor 15 to detect button depression. After execution of this command, the button 20 is active. Thus, the Enable command is inserted in the tape at a very precise point, corresponding to the video frame upon which a selection is to be made. When that screen display disappears, the button 20 is disabled by the appropriate "Disable Selection" command, and a video branch is made based on the selection indicated by the location of the cursor 22 upon depression of the button 20.

Thus, processor 15 decides which command in the software specification has been received through the register 95 and activates appropriate circuity for executing the command. When the processor 15 receives "00", it performs a reset, and initializes conditions. If the processor 15 receives a "1", it executes the command which follows as defined in the software specification above. The commands are executed immediately in real time and do not require any significant amount of execution time to execute. Hence, the commands are placed on the VCR tape 110 at the time they are to occur. Once a command is executed, the processor 15 typically returns to its looping mode.

For example, after executing the ENABLE BUTTON command, the processor 15 waits for new data or depression of the button 20. If the button 20 is triggered, the processor 15 latches the position of the cursor 22 and compares it to one of the four positions defined on the screen 18 by the instructions contained on the video tape 110.

Assume the sensor 16 is pointed to area one, corresponding to audio channel one. When the button 20 is depressed, the processor 15 sets the audio reg bit 1 (FIG. 10), causing the audio corresponding to area one to be played. (Branching is independent of whether the answer is right or wrong.) In such a case, the audio going into the television receiver 32 comes from the first channel. The processor 15 then waits for the next command. The next command which should be supplied from the tape 110 is a "disable selection" -no longer able to make choices- and do a video branch. The processor 15 then activates all but of the Quad control signals (FIG. 8) to blank various areas on the screen. This blanking will continue until the processor 15 gets a "clear" command, which says "return to the main audio channel, and return the screen to full." In such a case, the processor 15 responds by activating the appropriate audio and video control signals. Additionally, after the button 20 is pressed, the processor 15 will play the correct tune if the answer is correct, and and a "wrong tune or message" if the viewer's selection is incorrect. This step is software controllable in the sense that it maybe skipped by putting a command in the video tape 110 which instructs the processor 15 not to play a message.

The processor 15 also accumulates a score by counting the number of wrong answers and may also execute a branch-on-score command. For example, the processor 15 may decide which of the five possibilities is correct and may produce responses such as "you missed between 7 and 9." 

What is claimed is:
 1. Video apparatus comprising:applying means for applying a raster scan format television signal wherein a plurality of digital commands and audio segments are encoded as amplitude modulated pulses between successive horizontal sync intervals; a raster scanned video display employing a raster scan spot; means for generating a selection signal; means responsive to first control signals for modifying said television signal and supplying a modified television signal to said video display; cursor generation means for optically sensing a portion of said video display remotely, detecting the raster scan spot and generating second control signals for producing a cursor at a selected position of said display means in response to detection of the raster scan spot; means receiving said video information for separating said digital commands from said video information and supplying them as an output; means receiving said video information and responsive to third control signals for separating a selected one of said plurality of audio segments from said video information; and digital processor means responsive to said selected position of said cursor at the time of generation of said selection signal and to the digital commands outputted by said means for separating said digital commands to generate one or more of (1) first control signals for causing said modifying means to blank a selected portion of said video display and (2) third control signals for causing said means for separating said audio segments to separate a particular said audio segment.
 2. The apparatus of claim 1 wherein said cursor generation means further includes:photodetector means for generating an output pulse signal in response to detection of said raster scan spot; means for generating a horizontal and vertical position count from said television signal; and means responsive to said output pulse signal for latching said horizontal and vertical position count.
 3. The apparatus of claim 2 wherein said processor means determines said selected position from the latched horizontal and vertical position counts.
 4. The apparatus of claim 1 wherein said digital commands are encoded as binary level pulses, one bit only being encoded in the interval between adjacent horizontal sync pulses and occupying substantially the entire said interval not allocated to encoding audio segments so as to minimize decoding error.
 5. The apparatus of claim 1 wherein said audio segments are encoded in said video information according to the following format:a first return to median level interval; a first audio carrier pulse modulated above and below said median level; a second return to median level interval; a second audio carrier pulse modulated above and below said median level; a third return to median level interval; the first and second carrier pulses and first, second and third intervals being nested between a horizontal synchronization pulse and video data.
 6. The apparatus of claim 5 wherein said audio segments are modified near the occurrence of vertical sync intervals to reduce distortion.
 7. The apparatus of claim 5 wherein said means for separating said audio segments includes sample and hold circuit means for performing an instantaneous pulse-amplitude storage operation on each said audio modulated pulse and wherein said digital processor means selects the sample and hold interval corresponding to a desired audio segment.
 8. The apparatus of claim 1 wherein said applying means is a video cassette recorder and tape having an available bandwith on the order of two to two and one-half (2-2.5) Megahertz.
 9. The apparatus of claim 1 wherein said digital processor means responds to a command which defines an area of said display which causes a branch to a selected audio channel.
 10. A system comprising:first means for delivering a video signal wherein audio and digital information is encoded by pulse amplitude modulation of said video signal; second means for modifying and decoding said video signal; means for applying said video signal to said second means; means for displaying the video signal after decoding and modification by said second means for observation by and interaction with a human viewer; wherein said video signal is in raster scan format; wherein said audio and digital information is encoded between successive horizontal sync pulses of said raster scan format wherein said digital information is encoded as binary level pulses; and wherein each of a plurality of said pulses occupies substantially the entire interval between horizontal sync pulses not allocated to encoding said audio information.
 11. The apparatus of claim 10 wherein said means for delivering a video signal comprises a video cassette and said means for applying a video signal comprises a video cassette playback unit.
 12. A system comprising:first means for delivering a raster format video signal including encoded audio and digital information and wherein the digital information is encoded as a multiplicity of binary level pulses only one such binary level pulse being encoded during the interval between successive horizontal sync pulses of said video signal and occupying substantially the entire position of said interval not allocated to encoding audio information; second means for modifying and decoding said video signal in response to said digital information; means for applying said video signal to said second means; and means for displaying the decoded and modified video signal for observation by a human viewer.
 13. A system comprising:first means for delivering a video signal including audio information; second means for modifying and decoding said video signal; means for applying said video signal to said second means; means for displaying the decoded and modified video signal for observation by and interaction with a human viewer; and wherein said audio information is encoded in the video signal as follows:a first return-to-median level interval; a first audio carrier pulse modulated above and below said median level; a second return-to-median level interval; a second audio carrier pulse modulated above and below said median level; a third return-to-median level interval; the first and second carrier pulses and first, second, and third intervals being nested between a horizontal synchronization pulse and video data.
 14. Interactive video apparatus for use with a television receiver and associated screen comprising:means for transmitting a television signal containing digital commands and a plurality of video signals and audio channels; means for selectively blanking a plurality of areas of said screen; means receiving said television signal for selecting one of said audio channels; means for generating a cursor on the television screen; data logic means receiving said television signal for detecting valid digital commands and supplying said valid digital commands as an output; and digital processor means for receiving said digital commands from said data logic means and responsive to said valid digital commands and said cursor for controlling said means for selectively blanking and said means for selecting to cause one or more of (1) blanking of a particular area of said screen (2) transmission of one of said audio channels to said receiver.
 15. Video apparatus comprising:applying means for applying a first signal including video information, said video information including a raster scan format television signal and a plurality of digital commands and audio segments encoded in said signal; a raster scanned video display; means for generating a selection signal; means responsive to first control signals for modifying said television signal to generate a modified television signal and for supplying said modified television signal to said display; light detection means for producing an output signal in response to detection of the raster scan spot; means for generating a horizontal position count and vertical position count from said television signal; means receiving said video information for separating said digital commands from said video information and supplying them as an output; means receiving said video information and responsive to second control signals for separating said audio segments from said video information; means responsive to said output signal for latching said horizontal position count and vertical position count; comparison means for comparing said latched horizontal position count and vertical position count to the present horizontal position and vertical position counts and for supplying first control signals to said modifying means to generate a cursor on said video display; and digital processor means responsive to the latched position counts at the time of generation of said selection signal and to the digital commands output by said means for separating said digital commands to generate one or more of (1) first control signals for causing said modifying means to blank a selected portion of said video display and (2) second control signals for causing said means for separating said audio segments to select a particular audio segment.
 16. An interactive, audio and video branching apparatus comprising:applying means for applying a raster scan format television signal wherein a plurality of digital commands and audio segments are encoded as amplitude-modulated pulses between successive horizontal sync intervals; a raster scanned video display; means for generating a selection signal; means responsive to first control signals for modifying said television signal and supplying a modified television signal to said video display; cursor generation means for optically sensing a portion of said video display remotely, electrooptically detecting a segment of the raster and generating first control signals for producing a cursor at a selected position on said display means; means for receiving said video information, for separating said digital commands from said video information and supplying them as an output; means receiving said video information and responsive to second control signals for selectively separating said audio segments from said video information; and digital processor means responsive to said selected position of said cursor at the time of generation of said selection signal and to the digital commands output by said separating means to generate one or more of (1) first control signals for blanking a selected portion of said video display and (2) second control signals for selecting a particular audio segment to be separated so as to effect an associative audio and video branching action in said interactive audio and video branching apparatus.
 17. The apparatus of claim 16 wherein said television signal comprises a television raster pulse signal and wherein said cursor generation means further includes:photodetector means for generating an output pulse train in response to detection of an imaged raster scan segment; means for supplying an imaged raster scan segment to said photo detector means; means for generating a horizontal and vertical position count from said television raster signal; means responsive to said raster pulse signal for latching said horizontal and vertical position count.
 18. The apparatus of claim 17 wherein said raster scan format includes a horizontal synchronization signal and said audio segments are encoded in said video information according to the following format:one or more pulses located near the horizontal synchronization signal, said pulse or pulses having a no-modulation mean level of 50% of peak white; and audio amplitude modulation of said pulse or pulses up to plus or minus 100% of said no-modulation mean level.
 19. The apparatus of claim 16 wherein said audio modulated pulses are modified during their occurrence near vertical sync intervals to reduce distortion.
 20. The apparatus of claim 16 wherein said means for separating said audio segments includes sample and hold circuit means for performing a storage operation on each said audio modulated pulse.
 21. Video apparatus comprising:applying means for applying a raster scan format television signal wherein a plurality of digital commands and audio segments are encoded as amplitude modulated pulses between successive horizontal sync intervals; a raster scanned video display employing a raster scan spot; means for generating a selection signal; means responsive to first control signals for modifying said television signal and supplying a modified television signal to said video display; cursor generation means for optically sensing a portion of said video display remotely, detecting the raster scan spot and generating second control signals for producing a cursor at a selected position on said display means in response to detection of the raster scan spot; meand receiving said video information for separating said digital commands from said video information and supplying them as an output; means receiving said video information and responsive to third control signals for separating said audio segments from said video information; and digital processor means responsive to said selected position of said cursor at the time of generation of said selection signal and to the digital commands output by said separating means to generate one or more of (1) said first control signals and (2) said second control signals.
 22. Video apparatus comprising:applying means for applying a first signal including video information, said video information including a raster scan format television signal and a plurality of digital commands and audio segments encoded in said signal; a raster scanned video display; means for generating a selection signal; means responsive to first control signals for modifying said television signal and for supplying a modified television signal to said display; light detection means for producing an output signal in response to detection of the raster scan spot; means for generating a horizontal position count and vertical position count from said television signal; means receiving said video information for separating said digital commands from said video information and supplying them as an output; means receiving said video information and responsive to second control signals for separating said audio segments from said video information; means responsive to said output signal for latching said horizontal position count and vertical position count; comparison means for comparing said latched horizontal position count and vertical position count to the present horizontal position and vertical position counts and for supplying first control signals to said modifying means to generate a cursor on said video display; and digital processor means responsive to the latched position counts at the time of generation of said selection signal and to the digital commands output by said separating means to generate one or more of (1) said first control signals and (2) said second control signals.
 23. A system comprising:first means for delivering a video signal wherein audio and digital information is encoded as a multiplicity of pulse-amplitude-modulated audio and digital signals; second means for modifying and decoding said video signal to produce a decoded and modified video signal and for generating a cursor on said display means; means for applying said video signal to said second means; means for displaying the decoded and modified video signal for observation by and interaction with a human viewer; and wherein said cursor generation means further includes: handheld optical sensing means for varying the position of said cursor; and means for generating a selection signal in accordance with the perceived position of said cursor, and inputting said selection signal to said second means; and wherein said second means includes a processor means responsive to a signal related to the position of said cursor on said display means at the time of generation of said selection signal and to at least one said digital command to select one or more of (1) a selected portion of said display means for display and (2) a selected audio segment as a sound output.
 24. The apparatus of claim 23 wherein said modification means further includes data logic means for separating said digital commands from said video signal and supplying them to said processor means.
 25. The apparatus of claim 24 wherein said processor means responds to a command which defines the area of the display means which causes an audio branch to a selected audio segment.
 26. The apparatus of claim 25 wherein the response to a selection signal is software programmable.
 27. The apparatus of claim 26 wherein software programmable formats include a multiple-choice format, video-branching format and sequential format.
 28. The apparatus of claim 26 wherein said processor means further responds to a branch on score command.
 29. A system comprising:first means for delivering a video signal wherein audio and digital information is encoded as a multiplicity of pulse-amplitude-modulated audio and digital signals; second means for modifying and decoding said video signal to produce a decoded and modified video signal; means for applying said video signal to said second means; means for displaying the decoded and modified video signal for observation by and interaction with a human viewer; wherein said second means includes a digital processor means responsive to generation of a selection signal and at least one digital command to select one or more of (1) a selected portion of said display means for display and (2) a selected audio segment as a sound output; and wherein said audio information is encoded in the video signal as follows:a first return-to-median level interval; a first audio carrier pulse modulated above and below said median level; a second return-to-median level interval; a second audio carrier pulse modulated above and below said medium level; a third return-to-median level interval; the first and second carrier pulses and first, second, and third intervals being nested between a horizontal synchronization pulse and video data.
 30. A system comprising:first means for delivering a video signal; second means for decoding said video signal and for modifying said video signal in response to first control signals; means for applying said video signal to said second means; means for displaying the decoded and modified video signal for observation by and interaction with a human viewer; cursor generation means for optically sensing a portion of said display means and supplying a first control signal to said second means for generating a cursor on said display means; and wherein said display means comprises a raster scanned television receiver and wherein said cursor generating means comprises: means for maintaining a horizontal position count and a vertical line count; an optical sensor means for sensing a portion of the television screen and generating a series of pulses, one pulse being generated each time the horizontal raster spot trace enters the field of view of said optical sensor; and means responsive to one of said pulses and said horizontal position count and vertical line count for generating said first control signals.
 31. An interactive video apparatus including a display means comprising:means for delivering a video signal wherein at least four channels of audio and a plurality of video programs are encoded; and means for selecting one of said audio channels for playback in response to user-initiated branching to a selected one of said plurality of video programs for display on said display means, said means for selecting including an optical sensor means for visually interacting with said display means and causing said user-initiated branching.
 32. The interactive video apparatus of claim 31 wherein each audio channel relates to a particular one of said plurality of programs.
 33. The interactive video apparatus of claim 32 wherein said plurality of programs comprises four video programs encoded in said video signal.
 34. A system comprising:first means for delivering a video signal wherein audio and digital information is encoded by pulse amplitude modulation of said video signal; second means for modifying and decoding said video signal; means for applying said video signal to said second means; means for displaying the video signal after decoding and modification by said second means for observation by and interaction with a human viewer; and wherein said second means includes means for generating a cursor on said display means; said cursor generation means including:handheld optical sensing means for varying the position of said cursor; and means for generating a selection signal in accordance with the perceived position of said cursor and inputting said selection signal to said second means.
 35. The apparatus of claim 34 wherein said second means includes a processor means responsive to a signal related to the position of said cursor on said display means at the time of generation of said selection signal to select one or more of (1) a selected portion of said display means for display and (2) a selected audio segment as a sound output.
 36. The apparatus of claim 34 wherein said second means includes a processor means responsive to a signal related to the position of said cursor on said display means at the time of generation of said selection signal and to at least one said digital command to select one or more of (1) a selected portion of said display means for display and (2) a selected audio segment as a sound output.
 37. The apparatus of claim 36 wherein said modification means further includes data logic means for separating said digital commands from said video signal and supplying them to said processor means.
 38. The apparatus of claim 34 wherein said first means for delivering comprises a video cassette and said means for applying comprises a video cassette playback unit.
 39. A system comprising:first means for delivering a video signal wherein audio and digital information is encoded by pulse amplitude modulation of said video signal; second means for modifying and decoding said video signal; means for applying said video signal to said second means; means for displaying the video signal after decoding and modification by said second means for observation by and interaction with a human viewer; and wherein said second means includes means for generatng a mark on said display means by modifying said video signal, including a pointing device and means for generating said mark at the perceived pointing position of said pointing device.
 40. The apparatus of claim 39 wherein said first means for delivering comprises a video cassette and said means for applying comprises a video cassette playback unit.
 41. A system comprising:first means for delivering a video signal wherein audio and digital information is encoded by pulse amplitude modulation of said video signal; second means for modifying and decoding said video signal; means for applying said video signal to said second means; means for displaying the video signal after decoding and modification by said second means for observation by and interaction with a human viewer; wherein said video information includes a raster scan format television signal, wherein said display means comprises a raster scanned video display; and wherein said second means includes means for generating a selection signal in response to a manual input and for correlating the generation of said selection signal to the scan position of the raster scan of said video display; said means for generating and for correlating means comprising:means for modifying said video signal to generate a mark on said screen; light detection means for detecting the raster scan spot and for producing an output signal in response to detection of the raster scan spot; and means responsive to said output signal for adjusting the position of said mark on said display means.
 42. The apparatus of claim 41 wherein said means for adjusting is further responsive to the position of said raster scan spot at the time of occurrence of said output signal to adjust the position of said cursor.
 43. The apparatus of claim 42 wherein said means for adjusting determines the position of said raster scan spot utilizing counts of the horizontal and vertical synchronization pulses of said raster scan format television signal.
 44. The apparatus of claim 41 wherein said means for delivering comprises a video cassette and said first means for applying comprises a video cassette playback unit.
 45. An interactive video apparatus including a display means comprising:means for delivering a video signal wherein at least four channels of audio and a plurality of video programs are encoded; means for selecting one of said audio channels for playback in response to user-initiated branching to a selected one of said plurality of video programs for display on said display means; and wherein said video signal includes a horizontal synchronization signal and said audio channels are encoded in said video information according to the following format:one or more pulses located near the horizontal synchronization signal, said pulse or pulses having a no-modulation mean level of 50% of peak white; and audio amplitude modulation of said pulse or pulses up to ±100% of said no-modulation mean level. 